Rate-Distortion Analysis of Multiview Coding in a DIBR Framework
نویسندگان
چکیده
Depth image based rendering techniques for multiview applications have been recently introduced for efficient view generation at arbitrary camera positions. Encoding rate control has thus to consider both texture and depth data. Due to different structures of depth and texture images and their different roles on the rendered views, distributing the available bit budget between them however requires a careful analysis. Information loss due to texture coding affects the value of pixels in synthesized views while errors in depth information lead to shift in objects or unexpected patterns at their boundaries. In this paper, we address the problem of efficient bit allocation between textures and depth data of multiview video sequences. We adopt a rate-distortion framework based on a simplified model of depth and texture images. Our model preserves the main features of depth and texture images. Unlike most recent solutions, our method permits to avoid rendering at encoding time for distortion estimation so that the encoding complexity is not augmented. In addition to this, our model is independent of the underlying inB. Rajaei Ferdowsi University of Mashhad, Iran Sadjad Institute of Higher Education, Mashhad, Iran Tel.: +98-511-6029000 E-mail: [email protected] T. Maugey Signal Processing Laboratory (LTS4), École Polytechnique Fédérale de Lausanne (EPFL), Switzerland E-mail: [email protected] H.-R. Pourreza Ferdowsi University of Mashhad, Iran E-mail: [email protected] P. Frossard Signal Processing Laboratory (LTS4), École Polytechnique Fédérale de Lausanne (EPFL), Switzerland E-mail: [email protected] painting method that is used at decoder. Experiments confirm our theoretical results and the efficiency of our rate allocation strategy.
منابع مشابه
Efficient bit allocation for multiview image coding & view synthesis
The encoding of both texture and depth maps of a set of multiview images, captured by a set of spatially correlated cameras, is important for any 3D visual communication systems based on depth-image-based rendering (DIBR). In this paper, we address the problem of efficient bit allocation among texture and depth maps of multi-view images. We pose the following question: for chosen (1) coding too...
متن کاملAn Overview of Emerging Technologies for High Efficiency 3D Video Coding
3D video coding is one of the most popular research area in multimedia. This paper reviews the recent progress of the coding technologies for multiview video (MVV) and free view-point video (FVV) which is represented by MVV and depth maps. We first discuss the traditional multiview video coding (MVC) framework with different prediction structures. The rate-distortion performance and the view sw...
متن کاملDepth-based direct mode for multiview video coding
Multiview video plus depth sequence is considered as an efficient 3D video format for supporting advanced stereoscopic and auto-stereoscopic multiview displays. In order to encode this video format, several modes are commonly employed with rate distortion optimization technique. Specifically, direct mode is an efficient mode to encode homogeneous or stationary regions without encoding any addit...
متن کاملRate Distortion Analysis and Bit Allocation Scheme for Wavelet Lifting-Based Multiview Image Coding
This paper studies the distortion and the model-based bit allocation scheme of wavelet lifting-based multiview image coding. Redundancies among image views are removed by disparity-compensated wavelet lifting (DCWL). The distortion prediction of the low-pass and high-pass subbands of each image view from the DCWL process is analyzed. The derived distortion is used with different rate distortion...
متن کاملMultiview Video Coding Based on Global Motion Model
In this paper, we present a novel scheme for coding multiview video sequence based on global motion prediction between adjacent views. For that, the left-most view is compressed as reference sequence using standard block-based motion compensated prediction coding. And its right view is compressed with global motion prediction from the left view images. In the prediction, an eight-parameter glob...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Annales des Télécommunications
دوره 68 شماره
صفحات -
تاریخ انتشار 2013